NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

QUESTO: Interactive Construction of Objective Functions for Classification Tasks

https://doi.org/10.1111/cgf.13970

Das, Subhajit; Xu, Shenyu; Gleicher, Michael; Chang, Remco; Endert, Alex (June 2020, Computer Graphics Forum)
null (Ed.)
Full Text Available
CAVA: A Visual Analytics System for Exploratory Columnar Data Augmentation Using Knowledge Graphs

https://doi.org/10.1109/TVCG.2020.3030443

Cashman, Dylan; Xu, Shenyu; Das, Subhajit; Heimerl, Florian; Liu, Cong; Humayoun, Shah Rukh; Gleicher, Michael; Endert, Alex; Chang, Remco (January 2021, IEEE Transactions on Visualization and Computer Graphics)
null (Ed.)
Most visual analytics systems assume that all foraging for data happens before the analytics process; once analysis begins, the set of data attributes considered is fixed. Such separation of data construction from analysis precludes iteration that can enable foraging informed by the needs that arise in-situ during the analysis. The separation of the foraging loop from the data analysis tasks can limit the pace and scope of analysis. In this paper, we present CAVA, a system that integrates data curation and data augmentation with the traditional data exploration and analysis tasks, enabling information foraging in-situ during analysis. Identifying attributes to add to the dataset is difficult because it requires human knowledge to determine which available attributes will be helpful for the ensuing analytical tasks. CAVA crawls knowledge graphs to provide users with a a broad set of attributes drawn from external data to choose from. Users can then specify complex operations on knowledge graphs to construct additional attributes. CAVA shows how visual analytics can help users forage for attributes by letting users visually explore the set of available data, and by serving as an interface for query construction. It also provides visualizations of the knowledge graph itself to help users understand complex joins such as multi-hop aggregations. We assess the ability of our system to enable users to perform complex data combinations without programming in a user study over two datasets. We then demonstrate the generalizability of CAVA through two additional usage scenarios. The results of the evaluation confirm that CAVA is effective in helping the user perform data foraging that leads to improved analysis outcomes, and offer evidence in support of integrating data augmentation as a part of the visual analytics pipeline.
more » « less
Full Text Available
Geono-Cluster: Interactive Visual Cluster Analysis for Biologists

https://doi.org/10.1109/TVCG.2020.3002166

Das, Subhajit; Saket, Bahador; Kwon, Bum Chul; Endert, Alex (January 2020, IEEE Transactions on Visualization and Computer Graphics)
null (Ed.)
Full Text Available
A User-based Visual Analytics Workflow for Exploratory Model Analysis

https://doi.org/10.1111/cgf.13681

Cashman, Dylan; Humayoun, Shah Rukh; Heimerl, Florian; Park, Kendall; Das, Subhajit; Thompson, John; Saket, Bahador; Mosca, Abigail; Stasko, John and; Gleicher, Michael; et al (June 2019, Computer graphics forum)

Many visual analytics systems allow users to interact with machine learning models towards the goals of data exploration and insight generation on a given dataset. However, in some situations, insights may be less important than the production of an accurate predictive model for future use. In that case, users are more interested in generating of diverse and robust predictive models, verifying their performance on holdout data, and selecting the most suitable model for their usage scenario. In this paper, we consider the concept of Exploratory Model Analysis (EMA), which is defined as the process of discovering and selecting relevant models that can be used to make predictions on a data source. We delineate the differences between EMA and the well‐known term exploratory data analysis in terms of the desired outcome of the analytic process: insights into the data or a set of deployable models. The contributions of this work are a visual analytics system workflow for EMA, a user study, and two use cases validating the effectiveness of the workflow. We found that our system workflow enabled users to generate complex models, to assess them for various qualities, and to select the most relevant model for their task.
more » « less
Full Text Available

Search for: All records